Re-run this search with the SEG filter switched off
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= orf19.6103
(173 letters)
Database: Seq/AA.fsa
85,676 sequences; 40,655,052 total letters
Searching..................................................done
Score E Sequences producing significant alignments: (bits) Value
orf19.6103 Chr1 (15595..16116) [522 bp, 173 aa] Predicted ORF in... 361 e-129 CAWG_01357 c1 complement(3181006..3181443) [438 bp, 145 aa] 304 e-107 CD36_00100 Chr1 (27505..27960) [456 bp, 151 aa] similar to AA s... 236 3e-80 CTRG_04282 c6 (3698..4168) [471 bp, 156 aa] 204 1e-67 SPAPADRAFT_53067 c1 (17822..18358) [537 bp, 178 aa] 183 3e-59 PICST_28175 Chr1 (546416..546940) [525 bp, 174 aa] Sporulation-s... 160 4e-50 DEHA2A13332g Chr1 (1120931..1121413) [483 bp, 160 aa] similar to... 156 1e-48 CANTEDRAFT_114386 c18 complement(222999..223517) [519 bp, 172 aa] 148 2e-45 CLUG_03452 c4 (541878..542453) [576 bp, 191 aa] 135 3e-40 PGUG_01091 c1 (1905490..1905888) [399 bp, 132 aa] 108 3e-30 YER046W Chr5 (243179..243610) [432 bp, 143 aa] Meiosis-specific ... 65 2e-13 CPAR2_212530 Chr2 (2698368..2699561) [1194 bp, 397 aa] Hap43p-re... 28 6.7 PICST_30796 Chr3 (958579..959352) [774 bp, 257 aa] predicted pro... 28 9.2
>orf19.6103 Chr1 (15595..16116) [522 bp, 173 aa] Predicted ORF in Assemblies
19, 20 and 21
Length = 173
Score = 361 bits (926), Expect = e-129, Method: Compositional matrix adjust. Identities = 173/173 (100%), Positives = 173/173 (100%)
Query: 1 MAGKESCETERKKKIPSIFLRCKKSGVVMSSTEDELVRQLAQHTDLSSLASFNSSIHPYQ 60
MAGKESCETERKKKIPSIFLRCKKSGVVMSSTEDELVRQLAQHTDLSSLASFNSSIHPYQ
Sbjct: 1 MAGKESCETERKKKIPSIFLRCKKSGVVMSSTEDELVRQLAQHTDLSSLASFNSSIHPYQ 60
Query: 61 FDFIIEHERGIKLFGIPLFSHKSLWPIIDPSHYQSINGKKLSIPISLENYPLPDFDWQWQ 120
FDFIIEHERGIKLFGIPLFSHKSLWPIIDPSHYQSINGKKLSIPISLENYPLPDFDWQWQ
Sbjct: 61 FDFIIEHERGIKLFGIPLFSHKSLWPIIDPSHYQSINGKKLSIPISLENYPLPDFDWQWQ 120
Query: 121 WDRWYVFMFNDVDPHGWMYSNVFFQCAKWKGKYYFGNTVRKRVWIRLRKKCSP 173
WDRWYVFMFNDVDPHGWMYSNVFFQCAKWKGKYYFGNTVRKRVWIRLRKKCSP
Sbjct: 121 WDRWYVFMFNDVDPHGWMYSNVFFQCAKWKGKYYFGNTVRKRVWIRLRKKCSP 173
>CAWG_01357 c1 complement(3181006..3181443) [438 bp, 145 aa]
Length = 145
Score = 304 bits (779), Expect = e-107, Method: Compositional matrix adjust. Identities = 144/145 (99%), Positives = 145/145 (100%)
Query: 29 MSSTEDELVRQLAQHTDLSSLASFNSSIHPYQFDFIIEHERGIKLFGIPLFSHKSLWPII 88
MSSTEDELVRQLAQHTDLSSLASFNSSIHPYQFDFIIEHERGIKLFGIPLFSHKSLWPII
Sbjct: 1 MSSTEDELVRQLAQHTDLSSLASFNSSIHPYQFDFIIEHERGIKLFGIPLFSHKSLWPII 60
Query: 89 DPSHYQSINGKKLSIPISLENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQCAK 148
DPSHYQSINGKKLSIPISLENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQCAK
Sbjct: 61 DPSHYQSINGKKLSIPISLENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQCAK 120
Query: 149 WKGKYYFGNTVRKRVWIRLRKKCSP 173
WKGKYYFGNTVRKR+WIRLRKKCSP
Sbjct: 121 WKGKYYFGNTVRKRIWIRLRKKCSP 145
>CD36_00100 Chr1 (27505..27960) [456 bp, 151 aa] similar to AA
sequence:UniProt:Q6BY10
Length = 151
Score = 236 bits (601), Expect = 3e-80, Method: Compositional matrix adjust. Identities = 109/148 (73%), Positives = 125/148 (84%), Gaps = 7/148 (4%)
Query: 33 EDELVRQLAQHT-----DLSSLASFNSSIH--PYQFDFIIEHERGIKLFGIPLFSHKSLW 85
ED+LV QL++H D +++ SFN SI PYQFDFI+E+ERGIKLFGIPLFSHKSLW
Sbjct: 4 EDQLVHQLSEHLRTHSDDFTAITSFNESISHTPYQFDFIVENERGIKLFGIPLFSHKSLW 63
Query: 86 PIIDPSHYQSINGKKLSIPISLENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQ 145
PIIDPSHYQ+I LSIPISLENYPLPDF+W+W W WYVFM+ND+DPHGWMYS VFFQ
Sbjct: 64 PIIDPSHYQTITNHHLSIPISLENYPLPDFNWEWVWSHWYVFMYNDIDPHGWMYSTVFFQ 123
Query: 146 CAKWKGKYYFGNTVRKRVWIRLRKKCSP 173
C KWKGKYYFGNTVR+RVW+RLR+KC P
Sbjct: 124 CKKWKGKYYFGNTVRQRVWLRLRRKCYP 151
>CTRG_04282 c6 (3698..4168) [471 bp, 156 aa]
Length = 156
Score = 204 bits (519), Expect = 1e-67, Method: Compositional matrix adjust. Identities = 98/144 (68%), Positives = 117/144 (81%), Gaps = 4/144 (2%)
Query: 31 STEDELVRQLAQHTDLSSLASFNSSIH--PYQFDFIIEHERGIKLFGIPLFSHKSLWPII 88
S + +L R + D FN SI PYQFDFIIE+ERGIK+FGIPLFSH+SL PII
Sbjct: 12 SLKVQLARPGSLVEDFRVFKQFNDSISTTPYQFDFIIENERGIKIFGIPLFSHRSLLPII 71
Query: 89 DPSHYQSINGKKLSIPIS-LENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQCA 147
DPS+YQ+INGK+L++P+S LENYPLPD +W+W+W WYV+M+ DVDPHGWMYSN+FFQC
Sbjct: 72 DPSNYQNINGKRLTVPLSKLENYPLPDSNWEWEWSHWYVYMYKDVDPHGWMYSNLFFQCD 131
Query: 148 K-WKGKYYFGNTVRKRVWIRLRKK 170
WKGKYYFGNT+RKRVWIRLRKK
Sbjct: 132 NGWKGKYYFGNTIRKRVWIRLRKK 155
>SPAPADRAFT_53067 c1 (17822..18358) [537 bp, 178 aa]
Length = 178
Score = 183 bits (465), Expect = 3e-59, Method: Compositional matrix adjust. Identities = 91/165 (55%), Positives = 116/165 (70%), Gaps = 8/165 (4%)
Query: 16 PSIFLRCKKSGVVMSSTEDELVRQLAQHT--DLSSLASFNSSIH---PYQFDFIIEHERG 70
PSI S ++ D + R DLS FN +I PYQFDFIIE+ERG
Sbjct: 8 PSIPQHESDSVALIQQLNDHIQRSTTSSLSHDLSIFTEFNQTISKTTPYQFDFIIENERG 67
Query: 71 IKLFGIPLFSHKSLWPIIDPSHYQSINGKKLSIPIS-LENYPLPDFD-WQWQWDRWYVFM 128
IKLFGIPL+S KSL PIIDP +QSI L+IP+S + NYPLPD++ W+W WD+WYVFM
Sbjct: 68 IKLFGIPLYSQKSLLPIIDPKQFQSITKTSLNIPLSNIGNYPLPDYNQWEWTWDQWYVFM 127
Query: 129 FNDVDPHGWMYSNVFFQCAK-WKGKYYFGNTVRKRVWIRLRKKCS 172
+ DVDPHGW+YS +FFQ K W+GKYYFGN+VR+R+WIR+R++ +
Sbjct: 128 YKDVDPHGWIYSTMFFQSDKRWRGKYYFGNSVRRRIWIRMRQRIA 172
>PICST_28175 Chr1 (546416..546940) [525 bp, 174 aa] Sporulation-specific protein
73
Length = 174
Score = 160 bits (404), Expect = 4e-50, Method: Compositional matrix adjust. Identities = 78/146 (53%), Positives = 100/146 (68%), Gaps = 7/146 (4%)
Query: 33 EDELVRQLAQHTDLSSLASFNSSI--HPYQFDFIIEHERGIKLFGIPLFSHKSLWPIIDP 90
E + + + ++ L+ L+ FN I QFDF++E++RG+KLFGIPL+S SL P+IDP
Sbjct: 18 ETQHFKDSSLNSTLAILSLFNDDIIATDIQFDFVLENQRGMKLFGIPLYSKNSLLPLIDP 77
Query: 91 SHYQSINGKKLSIPIS-LENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQC--- 146
S YQSI GK+L I L N+PLPDF W W WD WYV M NDVD GW+YSN+FF
Sbjct: 78 STYQSIKGKRLLISADHLNNFPLPDFSWTWSWDSWYVLMCNDVDDQGWVYSNLFFNNYFT 137
Query: 147 -AKWKGKYYFGNTVRKRVWIRLRKKC 171
WKGKYY GN VR+R+W+R+RKK
Sbjct: 138 DRTWKGKYYLGNFVRRRIWVRMRKKS 163
>DEHA2A13332g Chr1 (1120931..1121413) [483 bp, 160 aa] similar to CA3855|IPF4645
Candida albicans IPF4645
Length = 160
Score = 156 bits (394), Expect = 1e-48, Method: Compositional matrix adjust. Identities = 70/143 (48%), Positives = 97/143 (67%), Gaps = 7/143 (4%)
Query: 37 VRQLAQHTD------LSSLASFNSSIHPYQFDFIIEHERGIKLFGIPLFSHKSLWPIIDP 90
+R L +H + L +L S+ +FDFI+E++RG K FGIPLFS+++L P IDP
Sbjct: 17 LRDLLKHANVKSGRLLKALDSYEYQNDNVRFDFIVENQRGTKFFGIPLFSNRTLLPFIDP 76
Query: 91 SHYQSINGKKLSIPIS-LENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQCAKW 149
YQ ++GK +S+ + + NYPLPD DW+W WD+WY+ M NDVD GW+YS + F W
Sbjct: 77 QTYQLLDGKTISLTYNNINNYPLPDMDWKWSWDKWYLLMANDVDAQGWLYSVIMFNSTHW 136
Query: 150 KGKYYFGNTVRKRVWIRLRKKCS 172
KGKY FGN VR+R+W+R+R K S
Sbjct: 137 KGKYLFGNFVRRRIWVRMRHKLS 159
>CANTEDRAFT_114386 c18 complement(222999..223517) [519 bp, 172 aa]
Length = 172
Score = 148 bits (373), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 65/127 (51%), Positives = 90/127 (70%), Gaps = 1/127 (0%)
Query: 46 LSSLASFNSSIHPYQFDFIIEHERGIKLFGIPLFSHKSLWPIIDPSHYQSINGKKLS-IP 104
L+S+ N + Y+ + ++E++RG+K+FGIP+FS SL P++DPS +Q +NG+ L I
Sbjct: 40 LNSMFVTNDDGYEYKLELVVENQRGLKIFGIPMFSSNSLIPMLDPSSFQLLNGQNLKVID 99
Query: 105 ISLENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQCAKWKGKYYFGNTVRKRVW 164
+L N PLPD +WQW W+ WYV M NDVD GW+YS + F WKGKYYFGN +R+RVW
Sbjct: 100 NNLNNLPLPDLNWQWHWECWYVLMLNDVDDQGWIYSKINFNSHHWKGKYYFGNFIRRRVW 159
Query: 165 IRLRKKC 171
IRLR +
Sbjct: 160 IRLRHRS 166
>CLUG_03452 c4 (541878..542453) [576 bp, 191 aa]
Length = 191
Score = 135 bits (341), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 61/131 (46%), Positives = 87/131 (66%), Gaps = 8/131 (6%)
Query: 48 SLASFNSSIHP-------YQFDFIIEHERGIKLFGIPLFSHKSLWPIIDPSHYQSINGKK 100
++ F+ I P + FD ++E++RG+K+ GIPLFS +SL P IDP YQ ++G
Sbjct: 16 NVGGFDKMIQPPLLRSLDFHFDVVLENQRGLKILGIPLFSRQSLMPCIDPPPYQHLDGLP 75
Query: 101 LSIPI-SLENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQCAKWKGKYYFGNTV 159
+ + S++NY LPDF W W WD W+V M NDVD GW Y +F + KW GKYYFG+ V
Sbjct: 76 VLLAYDSIDNYALPDFGWSWLWDTWHVLMLNDVDELGWAYLLLFLRSLKWHGKYYFGDFV 135
Query: 160 RKRVWIRLRKK 170
R+R+WIR+R++
Sbjct: 136 RRRMWIRMRQR 146
>PGUG_01091 c1 (1905490..1905888) [399 bp, 132 aa]
Length = 132
Score = 108 bits (270), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 47/97 (48%), Positives = 65/97 (67%), Gaps = 5/97 (5%)
Query: 62 DFIIEHERGIKLFGIPLFSHKSLWPIIDPSHYQSINGKKLSIPI-SLENYPLPDFDWQWQ 120
D+I+E++RG K FGIPL+S +SL P DPS ++++ G+KL + S++NYPLPD DW W
Sbjct: 6 DYIVEYQRGFKFFGIPLYSQRSLIPFADPSEFETLGGRKLLLSYGSIQNYPLPDLDWNWD 65
Query: 121 WDRWYVFMFNDVDPHGWMYSNVFFQCAKWKGKYYFGN 157
W+RWYV M + VD GWMY+ + W KY G
Sbjct: 66 WERWYVLMTDSVDDQGWMYAG----WSGWSSKYRLGT 98
>YER046W Chr5 (243179..243610) [432 bp, 143 aa] Meiosis-specific protein of
unknown function, required for spore wall formation
during sporulation; dispensible for both nuclear
divisions during meiosis
Length = 143
Score = 65.1 bits (157), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 47/125 (37%), Positives = 64/125 (51%), Gaps = 24/125 (19%)
Query: 62 DFIIEHERGIKLFGIPLFSHKSLWPIIDPSHYQSINGK--------KLSIPISLENYPLP 113
D +IE+ERGI L G PLFS K L P +DP +Q +N + K +I +E YP+
Sbjct: 16 DVLIENERGITLLGYPLFSPKILLPHVDPPQFQRLNTENGSLIALSKNTISNFIELYPI- 74
Query: 114 DFDWQW-------QWDRWYVFMF----NDVDPHGWMYSNVFFQCAKWKGKYYFGNTVRKR 162
D + Q +W+V M D+D GW YS F ++WK K VR+R
Sbjct: 75 DLSTERTAGSSSSQMTKWFVLMDYKEKYDIDDQGWCYS-WNFNNSRWKSK---NGLVRRR 130
Query: 163 VWIRL 167
VW+RL
Sbjct: 131 VWVRL 135
>CPAR2_212530 Chr2 (2698368..2699561) [1194 bp, 397 aa] Hap43p-repressed gene in
C. albicans
Length = 397
Score = 28.5 bits (62), Expect = 6.7, Method: Compositional matrix adjust. Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 17/76 (22%)
Query: 103 IPISLENYPLPDFD-------WQWQWDRWYV-FMFNDV-DPHGWMYSNVFFQCAKWKGKY 153
I I EN+P P F+ W+W D W + FN GW+Y + +KW+
Sbjct: 312 ITIHKENFPFPIFENDIYSYQWEWAEDDWKIDTEFNKAKSADGWVYYD-----SKWEHPR 366
Query: 154 Y---FGNTVRKRVWIR 166
Y F R R W R
Sbjct: 367 YRDGFSKFTRSRKWTR 382
>PICST_30796 Chr3 (958579..959352) [774 bp, 257 aa] predicted protein
Length = 257
Score = 28.1 bits (61), Expect = 9.2, Method: Compositional matrix adjust. Identities = 17/75 (22%), Positives = 33/75 (44%), Gaps = 10/75 (13%)
Query: 107 LENYPLPDFDWQWQWDRWYVFMFNDVDPHGWMYSNVFFQCAKWKGK----------YYFG 156
LE+Y + ++ + W ++ F+F V P+G + + KW G ++FG
Sbjct: 42 LEDYIIKNYGYFWFTLVYFAFLFGYVQPNGTIPWSKIVHTMKWYGANTVVFFLTQLWFFG 101
Query: 157 NTVRKRVWIRLRKKC 171
+ +R+ I C
Sbjct: 102 FLIFERINIATGGHC 116
Database: Seq/AA.fsa
Posted date: Feb 8, 2013 12:45 PM
Number of letters in database: 40,655,052
Number of sequences in database: 85,676
Lambda K H
0.323 0.138 0.466
Gapped Lambda K H 0.267 0.0410 0.140
Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 85676 Number of Hits to DB: 15,576,075 Number of extensions: 696328 Number of successful extensions: 1482 Number of sequences better than 10.0: 24 Number of HSP's gapped: 1467 Number of HSP's successfully gapped: 24 Length of query: 173 Length of database: 40,655,052 Length adjustment: 100 Effective length of query: 73 Effective length of database: 32,087,452 Effective search space: 2342383996 Effective search space used: 2342383996 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 61 (28.1 bits)